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ABSTRACT 

Concurrency  Control  Mechanisms  (CCMs)  in  multi-user  database  systems 
are  assigned  the  task  of  regulating  the  concurrent  access  to  shared  data 
so  that  basic  integrity  constraints  of  the  database  are  preserved. 
Although  the  theory'  of  COM  correctness  has  achieved  a  degree  of  mathemati- 
cal maturity,  the  theori"  of  quantitative  performance  evaluation  of  the 
various  CCKs  is  still  in  its  infancy.   This  stems  from  the  enormous  number 
of  CCMs,  the  complexity  of  the  algorithms,  the  lack  of  a  precise  descrip- 
tion in  some  standard  terminology  and  different  (and  sometimes  incomparable) 
assumptions  employed  about  the  underlying  environment. 

In  this  paper  we  propose  a  methodology  for  analyzing  the  performance 

of  CCMs.   The  methodology  is  based  on  paradigms  of  the  physical  sciences. 

.V3.'3ini  zi-   r.' 
Controlled  experiments  lead  to  hypotheses  about  system  behavior  and  then  a 


m.odel  is  proposed,  trying  to  explain  the  experimental  data.  The  model  is 
then  refined  by  comparing  its  predicted  results  to  results  of  new  experi- 
ments.  We  apply  this  methodology  to  dynamic  two-phase   locking    (2PL) . 

The  experimental  tools  used  are  detailed  simulation  programs.   Several 
interesting  trends  of  behavior  were  observed.   For  example,  the  mean  dead- 
lock rate  was  observed  to  be  logarithmic  in  the  ratio  of  transactions  in 
the  system  divided  by  total  number  of  database  items. 

We  also  propose  a  model  for  explaining  the  experimental  results,  using 
the  theory  of  the  occupancy  problems  and  Markov  processes.   The  model  cap- 
tures the  basic  trends  of  the  system  behavior  v;ith  a  high  degree  of  accuracy. 
Also,  several  variations  of  dynamic  2FI,  are  compared  and  tested. 
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1.   INTRODUCTION 

Concurrency  Control  Mechanisms  (CCMs)  in  multi-user  database  systems 
are  assigned  the  task  of  regulating  the  concurrent  access  to  shared  data. 
A  concurrency  control  mechanism  operates  correctly  if  it  presents  a  con- 
sistent and  coherent  view  to  each  user;  one  way  to  express  that  mathemati- 
cally is  by  the  theory  of  senaZizcibiHty    [BG,  80]  . 

In  addition  to  correctness,  good  performance   determines  the  usefulness 
of  a  CCM.   The  word  performance  (applied  to  CCMs)  is  a  vague  term  and  has 
been  quantified  differently  by  various  authors.   While  the  studies  of 
correctness  of  CCMs  achieved  a  degree  of  mathematical  maturity,  the  theory 
of  CCM  performance  is  still  in  its  infancy. 

The  reason  for  this  stems  from  the  following: 

—  The  existence  of  an  enormous  number  of  CCMs,  which  tends  to  spread 
the  research  effort. 

—  The  difficulty  in  capturing  the  logical  characterization  of  CCMs  by 
traditional  queuing  theoretic  modelling.   (The  CCMs  seem  to  require  an 
analysis  of  queuing  systems  with  simultaneous  possession  of  many  servers 

by  a  single  customer,  or  of  systems  where  service  time  distributions  of  any 
seJTver  depend  on  the  state  of  another  server.   The  above  problems  are  not 
known  to  have  a  closed  form  solution  for  queuing  networks  of  general 
topology.) 

—  The  incomparability  of  assumptions  stated  by  various  authors  (and 
also  the  believability  of  these  assumptions,  difficult  to  test,  due  to  lack 
of  commercial  implementations  of  many  of  the  CCMs) . 
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—  Lack  of  clarity  of  what  exactly  each  researcher  did,  especially  in 
cases  of  simulation-based  studies. 

In  short,  CCM  performance  analysis  is  in  need  of  a  methodology.   This 
paper  is  an  attempt  to  provide  a  coherent  methodology  for  CCM  study.   We 
apply  this  methodology  in  studying  dynamic  two-phase  locking  (2PL)  which 
is  one  of  the  few  basic  Techniques  for  database  concurrency  control. 

Our  suggestion  is  to  follow  the  paradigms  of  physics.   An  oversimpli- 
fied version  of  this  paradigm  is  as,,  fpJ.Lowsj>r,^. 

—  A  phenomenon  is  observed  ie.^. ,  ,bpt..,g^s   expands;  deadlocks  in  a 
database  increase  with  number  of . transactions) . 

—  Experiments  are  performed  ^n^._,a.jCg-ap^pl-led  fashion   and  at  various 
ranges  to  obtain  reliable  data.  (J^^^.r ajij./5ipp^atus  (or  a  program)  is  care- 
fully constructed  and  tested.   For^tii^  gas  example,  this  could  be  an 
apparatus  measuring  gas  pressure  as^.  a- fwnction  of  temperature,  for  various 
gases  and  a  wide  range  of  temperatures..  For  a  CCM  this  could  be  a  simu- 
lation program  which  replicates,  in  detail,  the  function  of  the  2PL 
method.)   Experiments  should  be  repivdusible   by  other  researchers. 

—  Laws  are  conjectured  in  order  to  explain  the  observations.   These 
laws  are  combined  to  a  theory;  a  theory  usually  employs  a  model  of  the 
restricted  universe  of  the  experiments,  trying  to  capture  the  fundamental 
trends  of  behavior  of  the  system.   (E.g.,  gas  molecules  are  perfectly 


As  [BG,  80]  showed,  it  is  possible  to  analyze  every  concurrency  control 
problem  into  two  major  subproblems,  namely,  read-write  and  write-write 
synchronization.   Every  CCM  usually  includes  a  submethod  for  solving 
each  of  these  subproblems.   Current  research  indicates  that  only  a  few 
subalgorithms  are  possible.   Most  of  them  are  variations  of  two  basic 
techniques:   two-phase  locking  (2PL}  and  timestamp  ordering  (TO). 
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elastic  balls,  moving  in  the  container  at  random  directions.  Transactions 
behave  as  balls,  thrown  randomly  at  a  set  of  urns  which  represent  the  data 
items,  in  a  sophisticated  occupancy  problem.) 

—  The  theory  predicts  results  of  future  experiments.   These  experi- 
ments are  then  performed  and  the  theory  is  refined  (correction-prediction 
method) . 

The  effectiveness   of  a  theory  is  measured  by  its  ability  to  explain 
observed  phenomena  and  to  predict'' results' of  future  experiments.   When  a 
theory  disagrees  with  new  experiments,  when  it  is  usually  "patched"  to 
handle  exceptions,  this  is  a  good  sign  that  a  new  theory  is  needed.   Some- 
times the  new  theory  may  incorporalE'6"the''Old  as  a  practical  projection  to 
a  restricted  universe  (e.g.,  class i'cl.1  mfechanics  and  relativity). 

We  shall  treat  CCMs  as  physical  systems.   Intuitively,  this  is 
meaningful  when  there  are  large  quantities  of  users  and  data  and  a  rich 
variety  of  transaction  requests  to  make  any  brute  force  approach  absurd. 
The  CCM  should  be  niae ,    i.e.,  smooth,  nonabrupt,  slowly  changing.   Our 
results  show  that  2PL  is  indeed  a  nice  CCM.   We  will  carefully  carry  out 
the  physics  paradigm  to  2PL  [ES,  G,  L,  T,76}.    We  first  define  the 
observed  system,  called  the  coTe  system.      Variations  of  the  core  will 
correspond  to  variations  of  the  2PL  method. 

We  concentrate  here  on  dynamic   methods,  where  locks  are  requested 
over  the  lifetime  of  a  transaction.   Most  implementations  of  2PL  are 
dynamic.   By  contrast,  most  previous  research  concentrated  on  analysis 
of  static   2PL  methods  (except  [LN,  81],  [ShS,  81]). 

First  we  describe  the  experiments  and  tabulate  results.   Various 
trends  in  the  observation  data  are  then  studied.   A  simple  theory  is 
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proposed  to  explain  trends,  and  its  closeness  to  observations  is  tested. 
The  following  basic  trends  were  observed  in  a  large  number  of  con- 
trolled experiments: 

—  The  (mean)  number  of  locked  items  in  the  database  is  almost  linear 

in  the  number  of  transactions  for  small  ratios  of  number  of  transactions 

over  number  of  items.   It  becomes  a  slowly  growing  function  of  this  ratio 

for  medium  values  of  the  ratio  (close  to  a  logarithmic  function) .   Then 

it  stabilizes. 

\Lcborr.      .-■■. 

—  The  (mean)  number  of  deadlocks  within  a  time  interval  is  logarithmic 

Z'-'Z.  '--■'   32f)oc::q 
in  the  ratio  of  number  of  transactions  to  the  number  of  items  in  the  data- 

base. 

—  Sequential  and  parallel  implementations  of  2PL  behave  almost  the 
same  (for  uniform  data  access) . 

—  The  nximber  of  deadlocks  per  unit  time  and  the  number  of  locked  items 
have  small  oscillations  (timewise)  around  their  mean  values. 

—  The  rate  of  immediately  granted  lock  requests  is  fairly  stable  over 
time . 

Many  other  (less  important)  trends  were  also  discovered.   Our  theory 
successfully  explains  the  behavior  of  the  mean  number  of  locked  items  and 
the  mean  deadlock  rate.   It  is  based  on  simple  combinatorial  models  (balls 
and  urns)  treated  in  a  nonstandard  way,  and  on  simple  Markov  process 
theory.   It  predicts  values  with  less  than  15%  error  from  actual  observations. 

We  considered  mainly  closed   database  systems. (I.e.,  the  number  of 
transactions  is  kept  constant  over  time.   A  completion  triggers  an  arrival.) 
All  simulation  programs  were  written  in  Pascal  and  the  simulations  ran  in 
a  UnixA^AX  environment. 
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Our  effort  is  only  a  start.  Many  experimentally  observed  "patterns" 
of  2PL  behavior  are  still  to  be  explained.  The  surprising  fact  is  that  a 
theory  to  explain  2PL  behavior  seems  within  reach. 

The  paper  is  organized  as  follows.   Section  2  surveys  past  work. 
Section  3  defines  the  core  model  and  its  variations;  it  also  describes 
the  simulation  programs.   Section  4  discusses  the  experimental  results 
over  all  models.   In  Section  5  we  propose  a  theory  to  explain  the  core 
behavior  (by  using  a  balls  and  urns  model) .   Section  6  deals  with  vari- 
ations of  the  core;  a  Markov  process  is  proposed  to  explain  the  observed 
deadlock  rate  behavior.   Section  7  outlines  future  work. 

We  present  tables  and  plots  depicting  simulation  observations,  in  Appendices 
II  and  III.   Appendix  I  contains  proofs  of  some  claims  done  in  the  Analysis. 


^ 


-6- 


2.   PAST  WORK 

Garcia-Molina  [GM,  79]  compared  variants  of  centralized  2PL  to  the 
majority  consensus  algorithm  of  [Th,  78] ,  mostly  based  on  simulation 
studies.   He  concluded  that  centralized  2PL  outperforms  the  Thomas's 
algorithm  under  all  tested  conditions.   His  assumptions  included  pre- 
declaration  of  readsets  and  writesets  and.  awfully  redundant  database. 
The  analysis  was  limited  to  optimistic __situations  in  which  run- time 
conflict  never  occurred.   Ries  [R,  79^. analyzed  by  simulation  various 
forms  of  centralized  2PL  and  variatiqn^.^Qf  ^distributed  2PL.   For  all 
variations,  static  locking  was  ass,yroe4.5a^ij  ^^^  ;Simulations  were  set  up  in 
a  way  that  no  deadlocks  occurred  a;K3.-_runr- time, conflicts  were  very  rare. 
Consequently,  the  main  result  was  that  all  methods  had  similar  behavior. 
One  of  the  early  analytic  approaches  to  performance  evaluation  of  con- 
currency control  schemes  was  by  Gelenbe  and  Sevcik  [GeS,  78] .   Their 
analysis  was  mainly  on  timestamp  ordering  methods.   However,  their  assump- 
tions about  the  update  rules  are  different  from  the  current  implementa- 
tions.  They  defined  measures  of  "coherence"  (how  much  the  values  of 
various  copies  of  the  data  agree)  and  "promptness"  (how  up-to-date  the 
data  are) .   Their  results  were  based  on  assumptions  about  transmission 
delays  and  state  that  coherence  improves  rather  quickly  and  promptness 
drops  slowly  under  all  methods  considered.   Menasce  and  Nakanishi  [MN,  79] 
examined  "conflict-oriented"  methods  analytically  and  locking  by  doing 
simulations.   The  analysis  was  mostly  done  on  "certification  methods." 
(A  transaction  has  three  phases:   a  read  phase,  a  computation  phase,  and 
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a  test  and  update  phase.   It  can  be  restarted  only  at  the  end  of  the  last 
phase.   After  the  transaction  has  visited  all  the  places  it  wants,  its 
timestamp  is  compared  against  timestamps  obtained  during  the  read  phase.) 
Unfortunately,  their  assumptions  were  not  very  clear  and  the  model  analy- 
sis seemed  to  be  oversimplified.   D.  Potier  and  P.  Leblanc  [DL,  80] 
analyzed  the  case  of  centralized  2PL  with  i^'catic  looking  (a  transaction  is 
allowed  to  enter  the  system  only  i£  all  its  requested  locks  are  available) . 
The  probability  that  a  new  transaction  is  granted  its  locks,  given  k 
other  transactions  already  active,  was  found.   A  general  limitation  of  all 
these  analyses  on  2PL  is  that  only  static  looking  is  examined.   Although 
this  simplifies  the  analysis  with  respfect  to  time  dependence  of  state 
changes  of  the  system,  the  issue  6f "dffadldcks  and  restarts  will  be  avoided 
under  static  locking.   In  older  to  be  able  to  quantify  the  degradation  on 
performance  due  to  restarts  or  deadlocks,  it  is  important  to  study  dynamic 
locking.   In  addition,  since  most  real  systems  use  dynamic  locking,  the 
primary  objective  of  our  analysis  is  to  focus  on  the  performance  of 
dynamic  concurrency  control  systems  including  the  discussion  of  the  rate 
of  deadlocks. 

Lin  and  Nolte  [LN,  81]  performed  simulations  for  the  case  of  dynamic 
locking,  and  A.  Shum  and  P.  Spirakis  [ShS,  81]  provided  an  analysis  of 
dynamic  2PL  which  gave  worst  case  bounds  for  any  2PL  method.   They  con- 
sidered open  systems  and  provided  an  upper  bound  to  the  mean  deadlock  rate, 
proportional  to  the  mean  number  of  transactions  in  the  system.  The  results 
in  this  paper  subsume  the  previous  results  of  [ShS,  81]. 
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3.   THE  CORE  MODEL  AND  VARIATIONS 

3.1.   Two-Phase  Locking 

The  database  is  viewed  as  a  set  of  items    (files,  records,  etc.). 
Users  are  modelled  via  tvansaotions .      A  transaction  is  a  sequence  of 
operations.   We  consider  only  read   and  wrn.te   operations.   A  transaction 
maps  a  consistent  database  state  int-o.. another  consistent  database  state. 
Hence,  if  transactions  are  performed  Sfiri^ly»  they  all  see  consistent 
states  and  the  final  state  is  consistent. 

V?hen  transactions  operate  conciirrently ,  we  need  to  impose  restric- 
tions on  the  order  of  events  so  that  the  execution  sequence  will  be 
consistent  and  will  lead  to  a  consistent  final  state.   One  way  to  achieve 
this  is  by  effectively  keeping  the  execution  equivalent  to  a  serial  one 
(seri-atizable   executions)  .   The  two-phase  locking  method  (2PL)  synchro- 
nizes read  and  write  operations  by  explicitly  detecting  and  preventing 
conflicts.  ■  ' 

It  regulates  data  access  as  follows.   Before  reading  data  item  x, 
a  transaction  must  own  a  read  lock   on  x.   Before  writing  into  x,  it 
must  own  a  write   lock   on  x.   Different  transactions  carinot  hold  both  a 
readlock  and  a  writelock  on  an  item  x.   Also,  when  a  transaction  sur- 
renders ownership  of  a  lock,  it  may  not  obtain  any  additional  locks. 
Thus,  transactions  operate  in  a  two-phase  manner:   one  in  which  locks  are 
acquired  and  another  in  which  they  are'  released. 

The  2PL  controller  may  block   a  transaction  by  causing  it  to  wait  for 
an  unavailable  lock.   Due  to  this  blocking,  deadlocks  may  result. 


/^ 


-9- 


Deadlocks  could  either  be  prevented   or  detected   and  resolved.     Deadlock 
prevention  is  a  "cautious"  strategy  which  may  lead  to  a  large  number  of 
transaction  restarts.   Deadlock  detection  uses  the  wait- for  graph.      This 
is  a  directed  graph  whose  nodes  represent  transactions;  an  edge  from  node 
i  to  node  j  means  that  transaction  i  waits  for  transaction  j  to  release 
some  locks.   To  resolve  deadlocks  when  tiiey  actually  occur,  some  trans- 
actions are  restarted  in  order  to  break  the  deadlock  cycle.   We  shall 
treat  a  deadlock  detection  2PL  implementation  which  was  used  in  our  simu- 
lations (the  core  system  and  its  variants) . 

3.2.   The  Core  System         '  •.  .-'^-'^"■^-■^■' 

In  the  interest  of  experimental  reproducibility,  we  shall  describe 
in  detail  the  system  we  simulated.   The  core  system   is  a  particular 
dynamic  2PL  implementation.   The  system  is  closed,    i.e.,  the  number  of 
transactions  (N)  is  held  constant  at  all  times.   A  transaction's  completion 
is  considered  to  trigger  an  immediate  arrival  of  another  transaction.   We 
assume  that  no  transaction  tries  to  have  more  than  one  lock  on  a  parti- 
cular item.   Figure  3.1  describes  the  major  program  modules  of  the  core 
system  implementation. 

Figure  3.1(b)  shows  the  purpose  and  interaction  of  the  procedures 
which  maintain  the  dynamically  changing  wait-for  graph  (wfg) .   These 
procedures  are  used  repeatedly  by  the  components  of  the  system  of 
Figure  3.1(a)  which  implement  the  locking  policy.   An  edge  in  the  wfg  is 
considered  to  be  labeled  with  an  item  number,  i.e.,  an  item  on  which  one 
transaction  is  waiting  for  the  other.   Note  that  this  may  actually  result 
in  a  multi-graph  in  case  of  a  parallel  -implementation  of  2PL.   The  wfg  is 
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Figure  3.1.   Core  System  Simulation  Program  Structure 
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maintained  acyclic  at  all  times  by  the  deadlock  resolution  mechanism. 

Figure  3.1(a)  lists  procedures  in  boxes.   An  arrow  from  box  A  to 
box  B  means  that  procedure  A  may  call  procedure  B.   The  cycle  (glock, 
extinction,  advanceq)  is  of  special  interest.   It  indicates  that  a  call 
from  toplock  to  glock  may  result  in  "extincting"  a  transaction  (due  to 
deadlock) .   This  may  trigger  other  transactions  to  be  advanced  in  their 
waiting  queues  requesting  more  locks,  thereby  (possibly)  causing  other 
deadlocks  {secondary  deadlock   effect).   Thus,  one  lock  request  "from 
outside"  may  induce  substantial  changes  in  the  data  structures. 

A  data  item  has  3  entries :   (1)^  locktype    (with  possible  values 
tcnlookedj   readZocked,   uritelocked) ;    (2)  waitg:   a  (wait)  queue  of  entries 
of  the  form  <transaction,  locktype _>, ;.  (3)  holdg:   a  set  of  transactions 
which  hold  locks  on  that  item. 

We  now  detail  the  functionality  of  the  various  simulation  procedures. 
Denote  a  lock  request  by  transaction   j  on  item   i  in  mode  k  by  Lijk. 

Toplock ;   Receives  lock  requests  from  the  input  stream.   Calls 
glock  (Lijk) . 

Lockeasy;   Called  with  Lijk  as  argument  by  glock.   It  returns 
'granted'  or  'postponed' .   If  i  is  unlocked  then  its  locktype  is  changed 
to  k,  j  is  put  on  i's  hold  queue  and  'granted'  is  returned.   If 
k = readlock  and  i's  locktype = readlock  and  i's  waitqueue  is  empty,  then  j 
is  added  to  i's  holdqueue,  and  lockeasy  returns  'granted'.   Otherwise, 
'postponed'  is  returned. 

Glock:   It  is  called  with  Lijk  arid  returns  'granted',  'postponed' 
or  'aborted' .   It  operates  in  four  stages. 
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Stage  1 

If  lockeasy  (Lijk)  =  'granted'  then  glock  responds  'granted'  and 
proceeds  to  stage  4. 

Stage  2 

Edges  (with  label  i)  from  j  to  each  t€i's  holdg  are  added,  one  at  a 
time,  to  indicate  that  t  blocks  j.   After  each  edge  is  added,  a  test  for 
cycles  in  the  wfg  is  performed;  in  case  of  a  cycle,  one  of  j  and  t  is 
selected  at  random  to  be  restarted.   (All  associated  edges  are  deleted 
from  the  wfg.)   If  all  edges  were  added  and  there  were  no  cycles  or  j  is 
extincted,  then  proceed  to  stage  3.   Else,  stage  2  is  repeated. 

Stage  3 

3.1.  If  j  is  extincted  then  re'Spond  'aborted'  and  go  to  stage  4. 

3.2.  If  lockeasy  (Lijk)  =  'granted'  then  respond  'granted'  and  remove 
edges  from  the  wfg  from  j  with  label  i;  go  to  stage  4.   (The  reason  lock- 
easy  is  called  again  is  that  changes  in  stage  2  might  "enable"  Lijk  to 
proceed. ) 

3.3.  Set  response  to  'postponed'  and  enqueue  j  on  i's  wait  queue. 

Stage  4 

If  j  holds  aut   number  of  locks  (cut  is  a  parameter  of  the  system) 
then  extinction (j)  is  performed.   We  then  consider  j  to  be  aompZeted. 
When  cut  =  °°,  no  completions  ever  take  place. 

Extinction :   It  handles  the  extinction  (due  to  deadlock  or  completion) 
of  active  (i.e.,  not-everywhere  blocked)  transactions.   It  is  called  with 
transaction  j  as  argument.   All  edges  in  the  wfg  associated  with   j   are 
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deleted.   All  items  are  scanned,  j  is  eliminated  from  all  wait  emd  hold 
queues.   Finally,  for  all  items  i  in  which  j  either  held  locks  or  was 
waiting,  advanceq(i)  is  performed,  since  j's  extinction  may  enable  other 
transactions  to  proceed. 

Advanceq ;   Advances  transaction  in  the  wait  queues.   It  is  called 
with  item  i.   Transactions  are  copied  from  i's  wait  queue  to  a  local 
queue  (scratchq) .   Edges  labelled  by  i  are  removed  from  the  wfg  and  i's 
wait  queue  is  now  empty.   One  by  one,  a  pair  <j,k>  is  dequeued  from 
scratchq.   If  j  is  not  extincted,  glock(Lijk)  is  performed. 

3.3.   Variations  of  the  Core  System 

We  considered  four  versions,  of  the  cort  system: 

Standard  Deterministic  (SD) 

Once  a  transaction  is  blocked  (on  soine  wait  queue) ,  no  lock  requests 
involving  this  transaction  are  generated  until  it  is  no  longer  on  a  wait 
queue.   That  is,  a  nonblocked  transaction  "moves"  sequentially  from  item 
to  item. 

Parallel  Deterministic  (PD) 

New  requests  may  be  generated  involving  a  blocked  transaction,  i.e., 
transactions  now  can  "split"  and  operate  in  parallel. 

Standard  Probabilistic  (SP) 

Same  as  SD  except  that  cut  is  used  in  a  probabilistic  sense  in  glock. 
At  stage  4,  completion  happens  with  probability  1/cut.   Also,  all  locks 

are  considered  to  be  writelocks. 

Parallel  Probabilistic  (PP) 

Same  as  PD,  with  completions  happening  with  probability  1/cut,  in  each 
new  item  access. 
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4.   RESULTS 


4.1.   The  Experiment 


The  input  stream  of  lock  requests  (of  the  form  Lijk)  is  randomly 

generated  as  follows:   A  nonblocked  transaction  j  (1  <  j  <  N)  is  randomly 

selected  out  of  the  n  nonblocked  (i.e.,  active)  transactions  (n  <  N) . 

An  item  on  which  the  transaction  neither  holds  a  lock  nor  waits  for  one 

is  then  (uniformly)  randomlv  selected.   The  lock  mode  (readlock  or  write- 

>V5  srii-  'Dl^:-. 

lock)  is  then  selected  randomly  as  well.   Thus,  a  lock  request  Lijk  is 
generated  and  toplock (Lijk)  is  invoked. 

This  process  is  repeated  for  totlocks   number  of  times.  TotZocks   is 
the  observation  interval,  usually  in  the  range  of  5,000-20,000  requests 
for  ranges  of  total  numbers  of  transactions  (N)  and  items  (M)  not 
exceeding  500.   Some  cases  of  M=1000,  N  <  M  and  totlocks  =  30,000  were 
also  tested.  \ 

At  all  times  the  simulation  program  is  keeping  the  following 
information. 

ni  =  number  of  locked  database  items. 

A 

nl  =  number  of  locks  held  by  all  transactions. 

nw  =  sum  of  lengths  of  all  waiting  queues  at  items. 

A 

nlj  =  number  of  locks  held  by  transaction  j. 

A 

nwj  =  number  of  times  transaction  j  appears  on  wait  queues. 
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At  each  interval  {repperiod)    the  program  reports  ni,  nl,  nw.  Within 
a  repperiod   number  of  lock  requests,  the  system  also  keeps  a  count  of 
ns  =  number  of  successful  (granted)  lock  requests. 

A 

nd  =  number  of  deadlocks. 

A 
nc  =  number  of  completions. 

These  quantities  are  re-initialized  to  0  every  repperiod  lock  requests. 

In  Table  A.l  we  present  the  output  from  one  experiment.   The 
quantities  la,  wa,  mxla  and  nact  denote  (respectively)  the  average  number  of 
locks  held  by  an  aborted  transaction,  the  average  number  of  wait  queue 
entries  associated  with  an  aborted  transaction,  the  maximum  number  of  locks 
held  by  an  aborted  transaction  and  the  average  (within  repperiod)  number  of 
nonblocked  (i.e.,  active)  transactions,  which  hold  at  least  one  lock. 

In  Figure  A.l  we  plot  the  behavior  of  the  quantities  ni,  nd,  ns  as 
time  (i.e.,  lock  requests)  progresses.   This  figure  is  a  typical  one.   VJe 
observe  that  ns  is  fairly  stable  and  that  nd  and  ni  oscillate  about  some 
average  value. 

Table  4.1  summarizes  the  experimental  results  for  SD  and  PD.   We 
observed  the  following  variations  for  PD  (for  some  of  the  parameters) . 

•  More  items  are  locked. 

o  More  deadlocks  are  observed.  ■ 

•  More  waiting  transactions  are  observed. 

•  Less  successes  (ns) . 

•  mxla  is  slightly  higher. 
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In  the  Appendix,  we  present  averages  for  the  observed  quantities  in 
tabular  form.   Each  experiment  lasted  from  5,000-20,000  lock  requests.   For 
each  value  of  cut,  27  experiments  were  carried  out.   In  all  our  tables, 
transactions  (N)  are  the  horizontal  axis  and  items  (M)  are  the  vertical  axis. 

Tables  and  diagrams  (Figures  4.2  and  4.3)  for  the  standard  deterministic 
(SD)  case  appear  in  the  Appendix,       ;.  , 

4.2.  Parallel  Deterministic  Case 

Tables  and  diagrams  for  PD  appear  in  the  Appendix. 

4.3.  Summary  for  5P  and  PP 

The  results  for  the  standard  probabilistic  (SP)  case  are  very  similar 
to  those  for  SD.   (See  Appendix  for  tables.)   For  PP  the  results  are  of 
the  same  general  nature,  although  there  are  some  logical  problems  (e.g., 
do  we  probabilistically  terminate  even  when  some  locks  are  still  waited 
for  by  a  transaction?).   See  the  Appendix  for  more  details. 

4.4.  General  Trends  (in  All  Systems) 

—  Measured  quantities  quickly  arrive  to  a  steady-state  level  (oscilla- 
tion around  some  clearly  defined  mean) . 

—  ni  varies  linearly  with  N  for  small  N,  then  it  varies  slowly  (almost 
logarithmically)  with  N^M  a.nd  finally  stabilizes  (to  M)  for  large  N. 

—  nd  (and  hence  the  mean  deadlock  rate)  is  a  very  slow  function  of  N/M 
(almost  logarithmic) . 
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—  ns  (and  hence  the  rate  of  successful  requests)  is  a  very  slow 
(logarithmic)  function  of  M/N.   It  has  the  smallest  oscillation  around 
its  mean  value  among  observed  quantities. 

—  The  parallel  and  sequential  data  access  modes  behave  the  same 
(sequential  data  access  is  slightly  more  efficient)  for  uniform  data 
access. 

—  Very  small  mean  number  of  active  transactions. 


\ 
\ 
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5.   A  MODEL 

In  this  section  we  present  a  model  for  explaining  the  behavior  of 
ni  in  our  simulations.   A  model  usually  serves  to  decrease  the  level  of 
detail  in  the  treatment  of  a  system,  while  still  capturing  its  basic 
laws  and  functions.   Hence,  in  designing  any   model  for  the  core  system, 
we  face  the  problem  of  choosing  the  relevant  "variables"  of  the  system 
to  keep  in  the  model.   Following  the  physics  paradigm,  we  overcome  the 
problem  of  over-information  by  ignoring  many  of  the  details  of  the 
actual  "logic"  of  the  system  and  try  i^^^irst  find  a  model  to  explain 
the  observed  quantities  ni,  nd  and  ns. 

The  experiments  showed  that  ni  approaches  a  mean  value  fairly 
quickly,  and  then  deadlocks,  completions  (if  any)  and  restarts  cause  ni 
to  oscillate  around  the  mean  with  oscillations  of  a  relatively  small 
magnitude.   This  is  possibly  due  to  the  fact  that  the  rate  of  deadlocks 
is  a  very  slow  function  of  N/M  and  that  only  a  few  (on  the  average) 
number  of  active  transactions  are  in  the  system,  even  for  small  values 
of  cut. 

Therefore,  we  shall  ignore  deadlocks  in  our  initial'  modelling  effort. 
In  Section  6,  we  provide  a  model  which  explains  the  observed  deadlock 
rate.   We  utilize  a  nonstandard  occupancy  problem  (for  a  treatment  of  the 
basic  problem,  see  [Fe,  66]  .   Balls  are  randomly  throv^n  into  a  set  of 
urns.   We  may  think  of  balls  as  transactions  and  of  urns  as  data  items. 
To  estimate  the  mean  number  of  items  locked  (deadlocks  aside),. we  perform 
process  BU  (Figure  5.1). 
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Figure  5.1.   Process  BU 

.1.   Start  with  a  set  of  N  balls  and  M  empty  urns  (i.e.,  N  active 
•transactions  and  all  database  items  unlocked) .   Let  n  =  N. 

.2.   If  n  =  0  then  stop. 

3.  -Generate  a  random  integer  i,   1  <  i  <  M. 

4.  n  -»-  n-  1    |*I.e.  ,  throw  a  ball*!  . 


5.   urn.  •*-   urn.  +1      I  *Md  ball  to  i   um*  I 
1      1         '  ' 


6.   If  urn.  =  1  then  n  <-  n  +  1    *A  transaction  succeeded  in 
locking  aji  unlocked  item.   It  can  now  remain  an  active  one* 


7.   goto  2. 

Intuitively,  process  BU  corresponds  to  the  SD  model  with  cut  =  •»  and 

without  deadlock  resolution.   When  BU  stops  (it  is  easy  to  see  that  it 

always  stops  after  at  most  N  +  M  throv;s)  ,  we  have  a  deadlocked  system.   As 

deadlock  resolution  only  introduces  relatively  small  perturbations  in  ni 

(by  the  experiments),  we  expect  BU  to  provide  the  mean  value  of  ni   (i.e., 

the  mean  number  of  urns  hit) . 

Theorem  5.1.   The  mean  number  of  items  locked  (i.e.,  urns  occupied) 
after  BU  stops  is  given  by  the  following  formula: 


ni 


=  ..,(.-(.-1)') 


vAiere  t  is  the  (mean)  time  for  the  process  to  stop,  and  is  given  by  the 
fixed  point  equation 


t  =  N  +  m(i  -  (l  -  i)  )  (**) 
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Theorem  5.2.    Equation  (**)  has  a  unique  positive  solution. 

Proofs  of  Theorems  5.1  and  5.2  are  given  in  Appendix  I. 
respectively. 

The  values  of  ni  from  equations  (*)  and  (**) ,  respectively,  agree  to 
a  surprising  degree  (error  less  than  15^)  with  the  results  of  the  system 
experiments,  for  cut  >  100.   See  Table  5.1. 

BU  has  a  natural  extension  to  SP  (or  to  SD  with  cut  <  <») .   The 
process  is  modified  to  capture  completion  with  probability  6  (=  1/cut) . 
The  new  process  is  described  in  Figure  5. '2. 

■  Figure  5.2.   Process  BU  (9) 

' .' ' .    \    \ 

/A  ,.  -  \  .  '^.-.-  . 

1.  Start  with  N  balls,  M  urns,  n  ■«-  N 

2.  If  n  =  0  then  stop 

3.  Generate  a  random  integer  i,   1  <  i  <  M 

4.  n  -«-  n-  1 


5 .   urn .  ■*•   urn .  +  1 
1      1 


6.  If  urn.  =  1  then 

a 

begin 

X  ■*-  random  number  in  [0,l3 

if  X  <  6  then       no-op 

else       n  -*-  n  +  1 
end 

7.  goto  2 
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Intuitively,  when  a  transaction  hits  an  empty  item  it  either  decides 
to  complete  (with  probability  6)  or  to  continue  being  active  (with 
probability  (1-6)). 

The  corresponding  theorems  are: 

Theorem  5.3.    The  mean  number  oJ.'  items  locked  when  BU(9)  stops  is 
given  by  the  following  formulas         '■- 

„i  .  m(i  -  (i  -  I)')  ("*) 

where  t  is  the  (mean)  time  for  BU(6)  to  finish  and  is  the  solution  of 
the  fixed  point  equation 

t  =  N  +  (l-O)Mfl  -  (l  -  i)  )  (****^ 

Kote  that  for  6  =  0  (i.e.,  cut  =  <»)  we  get  Theorem  5.1.   Also,  we  get 

Theorem  5.4.   Equation  (****)  has  a  unique  positive  solution. 

Proofs  cf  Theorems  5.3  and  5.4  are  given  in  Appendix  I,   Again, 
we  have  a  very  close  agreement  between  ni  (from  BU(9))  and  ni  (experi- 
mental) with  less  than  15%  error,  for  any  6.   (See  Appendix   II,  III). 
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5.2.   Nonempty  Urns  Distribution  (BU,  cut  =  °°) 

Let  p  (b,r)  =  probability  {at  timet  there  are  b  filled  urns  and  r 
balls  remain}.   Then 

Pt(^'^)  =  P^.l(b,r+1)  I  +  p^_^(b-l,r)(l  -  ^) 

with  "p  (N,M)  =  1       -/■-- 

o 

p  {b,r)  =  0     for  r  <  0 
p  (b,r)  =0     for  b  >  M  . 

We  eire  interested  in  the  distributions  of  (1)  the  time  t*  such  that 

r^^  =  0,   (2)  the  probability  p.*(b,^0)   for  0  <  b  <  M  which  we  call 
t  t* 

P(m)   where  m  =  M  -  b.  ".   ^  -  r"  •• 

Suppose  that  exactly  m  urns  are  empty  at  the  end.   Since  M-m  urns 
were  occupied,  exactly  M-m  new  balls  were  created  (in  addition  to  the 
initial  N)  .   Hence,  we  have  a  total  of  N  +  M-m  balls  placed  in  M  -  m  urns, 
so  that  each  of  these  urns  is  occupied.   From  classical  occupancy  prob- 
lems (Feller,  pp.  58,  92)  this  probability  is 

rr,    ^    .,N+M-m  /mN  "r"^  ,  ,  ,  v/M-mV^     ,  N+M-m  ' 
a(m)  =  M      11    2,   ("15  (  ^  l(M-m-v) 

^  '   v=0 

Hence   P(m)  =  X  •  a(m)   where  X  is  such  that 

M 

I   a(m)  =  i 
m=0        '^ 

VThen     N  -»■  »»,   we  know  from  sampling  theory    (Feller,   pp.    210-211)    thiat 

M 
mean(t*/m)   =^  M  log 


M  -  m  +  1 
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6.   RATES  OF  DEADLOCK  AITO  OF  SUCCESSFUL  LOCK  REQUESTS 

We  consider  a  model  for  explaining  the  mean  deadlock  rate  and  the 
mean  request  success  rate,  for  the  SD  and  SP  variations  of  the  case. 

6.1.   Theoretic  Estimation  of  nd  and  ns 

We  shall  assume  here  that  all  locks  are  writelocks.   Let  us  consider 
a  (modified)  wfg  where  an  edge  from  transaction  T  to  transaction  T' 
exists  if  and  only  if  either 

(1)  T  is  the  first  transaction  on  ttie  wait  queue  for  some  item  X 
and  T'   currently  owns  a  lock  on  XOpr',;  /-. 

(2)  T'  is  immediately  in  front  of  T  on  the  wait  queue  for  some 
item  X.   Note  that  due  to  the  sequential  nature  of  the  SP  and  SD  systems, 
there  can  be  at  most  one  edge  emanating  from  a  particular  transaction  T 
at  any  particular  point  in  time.   A  transaction  T  is  blocked   if  there 

is  an  edge  emanating  from  it,  else  it  is  considered  to  be  free.  New  (or 
restarted)  transactions  enter  the  system  as  free  transactions.  Whenever 
a  new  edge  is  added  to  the  graph,  it  must  emanate  from  a  previously  free 
transaction  (since  they  are  the  only  ones  which  can  gener-ate  lock  requests) 

At  any  point  in  time  (deadlocks  apart) ,  the  wfg  is  a  forest  of  trees 
whose  tree  roots  are  the  free  transactions.   The  total  number  of  nodes  of 
the  wfg  is  always  N  (closed  system) ,  and  the  number  (n)  of  trees  may 
vary  from  1  to  N.   (See  also  [ShS,  81]-. 

We  can  easily  define  a  discrete  time  Markov  process  describing  the 

system's  evolution.   Let  the  srate  of  the  system  be  (s, ,  ...,  s  ,k)  where 
■'  -In 
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A  A 

n  =  number  of  free  transactions  at  time  t,  and   s.  =  size  of  tree   i   at  t,  . 

k       1  k 

At  each  discrete  time,  one  of  the  free  transactions  is  selected  at  random 

and  makes  a  lock  request  (or  completes  with  prob  G  for  the  SP  system) .   If 

the  lock  is  granted,  the  state  vector  (s  ,...,s  ,k+l)  is  identical  in  the 

1      n 

first  n  components  to  {s, ,...,s  ,k).   Else,  a  conflict  happens  (an  edae 

1      n 

is  formed  from  the  selected  tree  root  to  another  tree  or  to  that  root's 
tree)  .   If  the  conflict  is  not  a  deadlock,  the  tv.'o  trees  merge  into  a  single 
tree  v.'hose  size  equals  the  sum  of  the  two  sizes.   If  the  conflict  is  a  dead- 
lock (when  the  root  conflicts  with  its  own  tree) ,  then  the  deadlock  resolu- 
tion mechanism  instantaneously  restarts  the  root  v;hich  caused  the  deadlock 
and  the  tree  is  broken  into  x  +  1  trees  where  x   is  the  number  of  sons  of 
the  deadlocked  root.  "   ^'"~   \^  w    ' 

A  completion  situation  is  similar  to  a  deadlock  in  terms  of  state 
transitions.   The  distribution  of  x   is  state-dependent  and  one  can  (for 
example)  use  the  number  of  random  partitions  of  the  integer  s.  -  1   (i   is 

the  deadlocked  tree  to  be  broken  (see  [ShS,  81]).   Let  p.  . (t,  )  = 

ID   k 

probability  {the  root  of  the  tree  T.  will  conflict  with  tree  T.  at  t,  ,, 

1  j     k-^l 

given  that  the  root  of  T.  is  selected  to  make  a  request  at  time  t  }. 

Then,  if  L. (t,  )   is  the  number  of  items  locked  bv  all  the  transactions  in 

the  tree  T.  at  t,  ,  we  can  estimate  p.  .  (t,  )   to  be 
3     k  /i:   k 

L.(t  ) 

Pij  V  = -^TT^ 

The  total  probability  of  conflict  at  t    will  then  be 
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or 


n    n 

a    =      I       I    -  •   (i-e)   •  Pi.  (t  ) 
i=l  j=l  ^^      *^ 


or      a  =  (1-6)  J  •  L(t.  )  (Eq.  6.1) 

where   ^(''t,)  ~  nuinber  of  locked  items  at  instant  t  . 

Recalling  our  results  of  Section  5  (and  simulation) ,  we  may  approxi- 
mate I'C'U.)  by  its  mean  value  and  get 

a  ^      (1-6)  J-  •  ni(N,M)  (Eq.  6.2) 

^^ 
v:here  Hi(N,M)  =m(1-{1-— 1   )  and  t*   is  given  by  Theorems  5.1  or 

5.3.   Equation  (6.2)  gives  us  the  mean  rate  of  conflicts  in  the  system. 

The  mean  rate  of  successful  requests  will  tjien  be 

,t*> 


rs 


=  1  -  a  ^     1  -  (1-6) fl  -  (l  -  ^)  )        (Eq.  6.3) 


where  t*   is  as  in  Theorem  5.3  or  5.1.   Note  that  (rs * totlocks)   is  a 

theoretic  estimation  of  the  ns  in  the  experiments.  We  found  that 

(rs  *  totlocks)   is  a  close  upper  bound  to  ns   for  both  SD  and  SP. 
The  total  probability  of  deadlock  at  t^.,  is 

n   ^ 

P^(t,  )   =   I       -  •  p..  (t,  )  •  (1-6) 
k      >,   n    11  k 

.  1=1 

=  ^-^^  — ^  •  (Eq.  6.4) 

n      M 

By  comparing  Equations  6.4  and  6.1  we  conclude  that 
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Leirana  6.1.   The  probability  of  a  deadlock  at  a  state  where  n  free 
transaction  exist  is  equal  to  a./n,   where  a  is  the  probability  of  conflict 
at  the  same  state.    The  average  deadlock  rate  is  equal  to 

N 


n=l 


d   „4i      n 


where  f(n)  =  probability  that  the  number  of  free  transactions  in  the  system 
(steady  state)  is  n. 

We  need  to  estimate  f(n).   Let  cut  < o°  and  consider  the  following 
Markov  process  of  continuous  time.   The  state  variable  is  the  number  of  free 
transactions  in  the  system  (n) .   We  consider  the  following  state  transitions 
and  rates.  •  ■  *£^  '_;^ 

n^n-l  with  rate  y   =  (l-9)a"  (1 — ),  i.e.  a  conflict  which  is  not  a  deadlock. 

n  n 

n^n+cut  with  rate      X      =  6   +    (1-G)—      (i.e.    either   a   completion,    or  a  deadlock, 

n  n 

we  use     6   =  — —  )  . 

cut 

This  simple  process  is  expected  to  capture  the  effect  of  deadlocks  and 
conflicts  on  the  mean  number  of  free  transactions.   VJe  should  remark  that  the 

above  process  describes  a  buZk  aJWivaJi  i,iji,tQm     Klei ,  76  with  constant  bulk  size 

a  1 

b  =  cut,  with  arrival  rate  X   =  G  +  (1-0)  —  and  completion  rate  y^  =  a* (1-9)  (l-~) 

The  above  process  is  equivalent  to  an  Erlangian  System  with  cut  =  number  of  stages 

and  with  state  dependent  rates.   The  steady  state  density  of  n  can  be  shown  to 

be  a  sum  of  geometries 

cut 

(Eq.  6.6)     f(K)  =  D(n=K)  =  -^  I      (c.)~^    K  =  1,2,. ..,N 

G   .^^    1 

where  G  is  a  normalizing  constant  depending  on  cut,  N,  9  and  a  , 

I 

p  =  — ,  where   X' ,  y'   are  the  mean 

arrival  and  departure  rates  and  the  c • ' s  are  constants  (depending  on  p  and  cut). 
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positive  and  less  than  1.   (Proof  omitted.   See   [Klei,76]   pp  129-130). 

Equation  6.6  implies  that  the  mean  value  for  n  is  a  very  slowly  growing  function 

cut 
(1-D)    r   ' 
of  N  (bounded  by  I  )    much  smaller  than  N.   All  our  simulations 

i=l  *  1 

validate  this  point  (see   Section  4  and  the  Appendix) . 

It  is  easy  to  remark  that  an  f(n)  which  has  high  values  only  on  small  n 

gives  the  worst  case  rate  of  deadlock.   Hence,  if  we  replace  cut  with  cut=l  in 

the  above  Markov  process,  we  decrease  the  births  and  we  will  get  an   f (n) 

concentrated  on  small   n  and  hence  a  pessimistic  r   .   The  modified  process 

with  cut  =  1  is  a  generalized  birth-death  process  with  solution 

A  'X  •  ...  X 

f  (K)  =  f  (1)  -^-^ ^-i 

m      m    y^'^a'  •  ••  ^K 

and 


f  (1)  = ^ r- 

m  N    A  •  . . .A„  , 

1  +  I       (J: JSzi 

k=2   ^2  ■• •  \ 


) 


Hence 


N 

1 
n=l 


IN 

r,  <   y   f  (n)  -  (Eq.  6.7) 

d  —  ^,   m    n 


Formula  (6.7)  theoretically  justifies  the  slow  (almost  logarithmic) 

increase  of  the  rate  of  deadlocks  with  N.   (Note  a  <_  1)  .   VJe  have  used 

(r  *  totlocks)  as  a  theoretic  estimate  of  n  .   It  is  always  an  upper  bound 
d  cl 

of  n^  for  both  the  SD  and  the  SP  systems, 
d 
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6.3.   Mean  Nmiber  of  Free  TransactioriS  and  Deadlock  Rate 

We  examine  how  the  mean  number  of  free  transactions  relates  to  the 

average  rate  of  deadlock;  we  consider  cut  = "»  for  the  SP  and  SD  variations. 

Let  n  =  mean  number  of  free  transactions  at  time  t   (n  =  N)   and  let 
t  .  o 

d  =  number  of  deadlocks  at  time  t.   Let  a  =  a(M,M)   be  the  probability 
of  conflict  (as  in  Eq.  6.2).   Let  us  also  assume   that,  when  a  deadlock 
happens,  a  nvimber  c  of  new  free  transactions  is  added  to  n   and  that 
the  mean  value  of  c  is  a  function  of  N  and  M  not  depending  on  t. 
Then, 

n^^^  =  (n^+c)  ^  +  (n^-1)  a  •  f  1  -  ^j  +  n^(l-a)       (Eq.  6.7) 
^  it 


—  J..1 
a    ■:  ■ 


(since   probability (deadlock)  = 

"t 


probability (success)   =  1  -  a 
probability (conflict  but  no  deadlock)  = 


Also,  d^_^^  "  '^t  "^  iT  ^^^3-  6.8) 


a_ 
^t 

Equation  6.7  implies       n    =  n  -  a  +  —  (c+1)  (Eq.  6.8) 

by  elementary  manipulation. 

t-1 

Equation  6.8  implies       d  =  a  J   —  (Ea.  6  9) 

t     .'^   n.  \   ^  ' 

1=0   1 

The  mean  deadlock  rate  is  defined  to  be  equal  to 

^t 

r^  =  Ixm  —  •      (Eq.  6.10) 
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From  Eq.    6.3  we  get, 


a  1+1  A     +     ^ 


n. 


c  +  1  c  +  1 


So,    by  Eq.    6.9, 


°t~       c+1  c+1 


,  dt  t         o      ,  a 

and  — —    ~     - — : — rr-     + 


t  f (c+1)  c+1 


Since       n     <  N       we  get. 


d 


Corollary:   The  mean  deadlock  rate  (v/hen  cut  =  <=>)  is  proportional  to 
the  ratio  of  the  mean  probaiiility  of  conflict  divided  by  the  sum  of  one 
and  the  mean  number  of  transactions  freed  in  each  deadlock. 
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7.   CONCLUSIONS  ANT)  FUTURE  WORK 

VJe  aimed  at  formulating  a  methodology   for  analyzing  CCMs.   We  followed 
the  traditional  paradigms  of  experimental  sciences.   Experiments  were  per- 
formed in  a  controlled  fashion  to  obtain  observation  data  and  gain 
intuition  concerning  CCMs'  behavior.   In  those  (reproducible)  experiments, 
we  discovered  that  2PL  behaves  in  a  "physical"  fashion. 

A  model  was  constructed  and  solved  analytically;  it  successfully 
explained  trends  regarding  the  number  of  locked  items  in  the  system.   The 
model  is  based  on  the  classical  occupancy  problem  with  balls  representing 
transactions  and  urns  representing  dati'^itfems.   We  have  also  provided 
analytic  modelling  of  the  second  order  effects  of  deadlocks,  and  of  the 
mean  number  of  free  transactions  in  the  system. 

The  next  step  is  to  extend  the  modelling  effort  in  order  to  explain 
more  of  the  observations.   Only  once  this  is  done,  and  the  behavior  of  the 
dominant  2PL  system  "variables"  is  understood,  will  we  feel  ready  to 
tackle  the  important  issues  of: 

(a)  throughput, 

(b)  response  time, 

(c)  influence  of  xead/write  mix  on  system  behavior, 

(d)  other  lock  request  distributions, 

(e)  locality  of  requests, 

(f)  extensions  to  distributed  databases. 
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Al.l 
Appendix  I , 


Mean  Value  Analysis  of  3U  and  BU(6) 


We 


begin  by  proving  Theorem  5.1.  Let  b  =  mean  number  of  filled 
urns  at  time  t+1  and  r  =  remaining  balls  outside  of  urns  at  time  t +  1 
(mean).   Then  the  following  recurrence,  relations  hold: 

b. 


(^  ■  ^ 


Vi  =  \  •-!* 'V"  I' ■-■■:' 1  '^- " 


=  'tG-^)*'V»T  <^-'' 


t+l 

since,  the  probability  that,  at  time  t„  a  ball  will  hit  an  empty  box,  is 
equal  to  b  /M. 

(Eq.  1)   implies      b^^^  =  b^^l  -  -^j  +  ^ 

(Eq.  2)   implies      r^_^^  "  ^t  "  IT 

Since  b  =  0  and  r  =  N,   we  get 
o  o 

b^  =  l     and     b^  =  (i-i)'        ^(l-^)''       +...   +1    , 
implying         b^  =  Jl  "   (l  "  ^   )  ^^^'    ^^ 


t-1 

and        r^  =  N  -  f7   ^  b,  ,   implying 

i=l 


t  =  ^^  -  I  Ji  ""i  '    ^'"''^'''^ 

r^  =  N  -  t  +  m(i  -  (l  -  i)"")  (Eq.  4) 

Let  t*  be  the  time  at  which  BU  stops.   Then  r^^  =  0  and  thus 
t*  =  N  +  m(i  -  (l  -  ^)   ) 

and         b  ^  =  ni  =  n(l   ~  (■'■  "  m)   ]     °^  Equation  3. 


Al,2 


Note  that  N  <  t*  <  N  +  M. 

For  the  proof  of  Theorem  5.3,  let  b^,  r^  be  defined  as  above  and  let 
e  =  probability  of  completion.  Again,  b^  =  0,   r^  =  N  and 


But  now. 


-t.i  =  ^v^4^  *  (^  -  if)  •  ^]  ^  ^t  •  ^'H'  -  if)  (^^-  ^^ 


Again  Equation  (a)  implies  that 


b^  =  M  (i  -  (i  -  i)  );i  -  i|;---:)  •'      .         (Eq.  c) 


However,  now  -  .  ;  .  ' 

\ 

-t+1  =  ^t  -  IT  (^-9)  -  ® 

implying 

1     e    ^"■'• 

r,  =  n  -  e   and   r^  =  N —   7  b.  -  t  ♦  9 

1  t        M    .^,   1 

1=1 

By  using  the  expression  for  b.,   i  <  t,  we  get  (by  elementary  calculations) 


r  =  N  -  t  + 


a.s,„  (i  -  (i  .  i)') 


leading  to  the  following  expression  for  t*   (such  that  r   =  0) : 

t 

t*  =  N  +  (l-e)M  fl  -  fl  -  ^j   j  (Eq.  d) 

and  hence  by  Equation  c 

(l  -  (l  -  ^)'  )  (Eq.  e) 


ni  =  M 


Equations  (d)  and  (e)  prove  Theorem  5.3. 

Note  again  that  N  <  t*  <  N  +  (1-9)M  for  BU<e)'s  termination  time. 


A1.3 


Uniqueness  of  t*  >  0 
Next,  we  prove  Theorems  5.2  and  5.4.   We  prove  that  the  equation 

t  =  N  +  (1-9)  mL  -  fl  -  ^j  J  =  f  (t) 

(N,M  >  0  ,        e  e   tO,l]) 

has  a  unique  positive  solution.   This  will  prove  both  Theorems  5.4  and 
5.2   (6  =  0). 

Consider  first  the  sequence  t  =  N  +  (1-6)M  and   t.  .  =  f(t.). 

^  o  1+1        1 

Clearly,   t  <  t   and  t  <  N  +  (1-6) M.   Assume  (inductively)  that 

t.  <  t.  ,   and  t.  <  N  +  (1-6 )M.   We  have 
1    1-1        1 


t.  ,  =  N  + 
1+1 


implying 


(i-e)M[i  -  (i  -  i)"' 

.  =  N  +  (1=6)m[i  -  (l  -  I)  "~\ 


N  +  (1-8) M  -  t.  ,      /    ,N  i   i-1 


N 


•  ^^-"^"  "  ^i+1   _  (  1\ 

+  (i-e)M  -  t.  \    ~  Mj 


1 


since   0  <  1  <  1   and   t.  <  t.  ^    we  get 

M  1    1-1 


N  +  (l-e)M  -  t 
hl±     >  1 

N  +  (1-6) M  -  t. 

1 


implying  that 


t.  ,  <  t.  <  N  +  (1-6)M 
1+1'    1 


So,  {t  }  is  strictly  decreasing  and  bounded  below  (by  N) ;  hence  it  converges, 
i 

In  fact,  any  sequence  of   t°  =  a  >  0  and  t.  =  f (t^_^)   will  converge  (same 

a. 
proof),  to  a  nonzero  number.   We  still  have  to  prove  that  all   {t^}   will 

converge  to  the  same  (nonzero)  limit. 


A1.4 


•-Let     A  =  t?    ,        B  =  t. 
-'.1  1 


a,   3  >  0 


and  a""  =  t^^^  =  f  (t°)    ,  b""  =  t^^^  =  f  (t^) 


Let 


A.    =    1A-B| 


^,,=  |A^-B^|  =  (i-e)Ml(i-i)^-(i-0 


=    (1 


-e)M  (i  -  i) 


\niax(A,B) 


-  (^  -  ^f  i 


f  >,max(A,B) 

<  (i-e)M  (i  -  -1  •  A.  .  - 


<  (1 


\max(A,B) 


Hence 


A.^.   <  A    (1 
1+1  o 


-e)^(i  - 


=  g(i) 


But  the  sequence  g(i)  ->  0  as   i  -*■  ".   Hence  A.  -»•  0  as  i  ->■  ".   So  both 
{t.},   {t.}  converge  to  the  same  positive  limit.   Hence  t  =  f  (t)   has  a 


unique  positive  solution. 
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Table  ^-l-  N=10  M=320  Variation=PP 

Cut=100  Totlocks=5000 

Report   every  100   requests 
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Table  A. 2 
Example  of  comparison  of  model  and  simulation 


Mean  Niimber 

of  Urns 

Occupied  (B 

tJ)  -^  Theoretical 

10 

20 

40 

80 

160 

320 

10 

8 

9 

10 

10 

10 

10 

20 

14 

17 

19 

20 

20 

20 

40 

22 

28 

34 

38 

39 

40 

80 

34 

44 

56 

67 

76 

79 

160 

50 

68 

88 
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320 

24 
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(b) 


System  SD,    cut  =   100 
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APPENDIX  III. 
Simulation  Results 
In  all  tables,  the  horizontal  axis  is  transactions  (N)  and  the 
vertical  axis  is  items  (M) ;  Q  =   1/cut.   The  averages  are  on  5000  -  20000 
lock  requests  and  27  controlled  experiments.   The  reporting  period  in 
each  experiment  was  100  lock  requests. 
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A.l.   Averages  for  measured  quantities  of 

the  standard  probabilistic  (SP)  system. 
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A. 2  Data  and  trends  for  the  parallel  deterministic  (PD)  system. 

(  a)     cut  =  100 

No  completions  were  recorded. 

ni :   increases  from  log  (N/M)  to  M. 

nd:   proportional  to  log  (N/M) . 

ns:   increasing  almost  like  -log  (N/M)  and  then  flattening. 

mxla:   proportional  to  log  (N/M) . 

nw:   proportional  to  N  (and  very  large) . 
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Figure  8.1 
Items  locked  vs  log  N 
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(b)   cut  =  4 


ni :   proportional  to  N 

nd:   proportional  to  log  N/M 

mxla:  negligible 

ns:   no  conclusion 

nw:   proportional  to  N 
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number  of  completions  after  threshold:   proportional  to  log  N/M 


ns 


cut  =  4 
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14 
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15 
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55 
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13 

42 

116 

.  285 
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31 

98 

256 
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160 
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18 

71 

206 

543 

320 
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39 

143 

443 

A-20 


Figure  8.4.   Number  of  locked  items  (ni)  versus  N 


A-21 
(c)   cut  =  16 
Only  a  few  completions.   Very  similar  to  cut  =  100. 
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13 
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18 

19 
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10 

13 

15 

18 

320 

2 
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12 

15 
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(d)   cut  =  8 


ni:  slower  than  linear  to  N 

nd:  proportional  to  log  N/M 

ns :  no  remarks 

mxla:  decreases  with  N 

1092(36) 
nv;:    proportional  to  N         for  6£[1,2] 

number  of  completions:   proportional  to   log (N/M)  (after  a  threshold) 
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15 
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11 
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17 

320 
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0 
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nw 


10 


20 


40 
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320 
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10 

29 

60 

135 
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24 

61 

136 
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80 
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281 
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160 
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14 

48 
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314 

589 

320 

2 
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37 

116 

306 
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A. 3  The  Analytic  Model  of  Balls  and  Urns  for  SP. 
All  tables  refer  to  nw. 
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9   =    1/100 


10  20  40  80  160  320 


10 

8 
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10 
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20 

14 

17 

19 
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80 

34 

44 

56 

67 
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50 

68 

88 
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320 

74 

101 

135 
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e  =  1/16 
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5 
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20 

40 

80 

160 

320 

10 
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20 

11 

14 

17 

IS 

20  -' 

20 

20 

40 

16 

21 

27 

33 

38 

40 

40 

80 

22 

31 

42 

55 

67 

.;    76 

79 
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30 
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109 
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87 

123 
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218 
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6  =  1/12 
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40 

80 
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320 
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27 

33 

38 
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21 

30 

41 

54 

66 

76 

79 
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28 
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60 

82 
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150 
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83 
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215 
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9  =  1/8 

5 
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20 

40 

80 
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320 

10 
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10 

10 

10 

10 

20 

10 

13 

17 

19 

20 

20 

20 

40 

14 

20 

2.7 

33 

38 

40 

40 

80 

19 

28 

40 

53 

66 
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79 
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24 

38 

56 

79 

106 

131 

151 
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29 

48 
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130 
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211 
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e  =  1/6 
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24 
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69 
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153 
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260 

G  = 

1/4 

5 

10 

20 

40 

80 

160 
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10 
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20 
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13 

16 
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20 

40 

12 

18 

25 

32 

37 

40 

40 

80 

14 

23 

35 

50 

64 

75 

79 

160 

17 

29 

47 

71 

99 

128 

149 

320 

18 

33 

57 

93 

141 

198 

255 

G  = 

1/2 

5 

10 

20 

40 

80 

160 

320 

10 

5 

8 

9 

10 

10 

10 

10 

20 
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11 

15 

18 

20 

20 

20 

40 
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